Speaker Recognition Using Principal Component Analysis
نویسندگان
چکیده
This paper proposes a new feature vector— Mel Frequency Principal Coefficient(MFPC), applied to speaker recognition. It is derived by performing Principal Component Analysis on the Mel Scale Spectrum Vector. Compared with conventional Mel Frequency Cepstrum Coefficient, MFPC efficiently exploited the correlation information among different frequency channels. These correlations, which is mainly caused by the vocal tract resonance, have been found to vary consistently from one speaker to another. And we select these feature coefficients according to their Fisher Ratio, which will guarantee the largest discriminability between classes in the given dimensionality. Finally, we implement a textindependent speaker recognition system. It uses Vector Quantization to design codebooks of given reference speakers. The experiment results demonstrate that our proposed feature vector has characteristics of compactness, large discriminability and low redundancy.
منابع مشابه
Speaker recognition using supervised probabilistic principal component analysis
In this study, a supervised probabilistic principal component analysis (SPPCA) model is proposed in order to integrate the speaker label information into a factor analysis approach using the well-known probabilistic principal component analysis (PPCA) model under a support vector machine (SVM) framework. The latent factor from the proposed model is believed to be more discriminative than one fr...
متن کاملSpeaker recognition using MPEG-7 descriptors
Our purpose is to evaluate the efficiency of MPEG-7 audio descriptors for speaker recognition. The upcoming MPEG-7 standard provides audio feature descriptors, which are useful for many applications. One example application is a speaker recognition system, in which reduced-dimension log-spectral features based on MPEG-7 descriptors are used to train hidden Markov models for individual speakers....
متن کاملOn the Use of Gaussian M Speaker Variabili
Analysis and modeling of speaker variability is important to help understand in-depth inter-speaker variances and to enhance current speech/speaker recognition system. In this paper we introduce adapted Gaussian mixture model (GMM) based speaker representation for the task. Two powerful multivariate statistical analysis methods, principal component analysis (PCA) and independent component analy...
متن کاملUnderstanding Speaker Variability Using Correlation-based Principal Component Analysis Thesis Proposal
In this research, we study the relationship amongst speakers and diierent sounds in speech. We propose a new speaker normalization/adaptation model which incorporates correlations amongst phoneme classes, and explore the applications of the model. Using principal component analysis we construct a speaker space based on a speaker covariance matrix obtained from the training data. The speaker cov...
متن کاملACOUSTIC MODEL ADAPTATION FOR AUTOMATIC SPEECH RECOGNITION AND ANIMAL VOCALIZATION CLASSIFICATION by
ACOUSTIC MODEL ADAPTATION FOR AUTOMATIC SPEECH RECOGNITION AND ANIMAL VOCALIZATION CLASSIFICATION Jidong Tao, B.Eng., M.S. Marquette University, 2009 Automatic speech recognition (ASR) converts human speech to readable text. Acoustic model adaptation, also called speaker adaptation, is one of the most promising techniques in ASR for improving recognition accuracy. Adaptation works by tuning a g...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001